Lesson: Performing Post-Match Processing 


J. Smith is in table B. These are really two different people, yet you have the same unique 
ID assigned to both of them as though they were the same person. As you do a more 
comprehensive match process between these two tables, you discover that these two 
rows are not duplicates at all, yet they have the same unique ID. The Split option splits 
these two rows into different data collections, keeping the existing unique ID on one of the 
records and assigning a new unique ID to the other. 


Assigning Unique IDs 


When specifying which numbers to use for unique IDs, the following methods are available: 


e Usea file of your own to assign a sequential number to records beginning at any number 
you choose. 


e Manually enter a starting unique ID value. Send the starting unique ID through a field in 
your data source created using the Query transform. The starting unique ID is passed to 
the Match transform before the first new unique ID is requested. If no unique ID is 
received, the starting number will default to 1. 


Use caution when using the Field option. The field that you use must contain the unique ID 
value you want to use to begin the sequential numbering. This means that each record you 
process must contain this field, and every record must have the same value in this field. 


e Write unique IDs that are dropped during delete processing back to a file to be used later. 


e You can recycle your own IDs by entering them in a file using the XML tag <R></R> as 
shown in the following example: 

<UniquelDSession> 

<CurrentUniquelID>477</CurrentUniquelID> 

<R>599</R> 

<R>814</R> 

</UniqueIDSession> 


Match Reports 
There are many match reports available to help you analyze your match results. 


e Match Contribution—The Match Contribution report provides information on the effect of 
the individual break groups and individual criteria on the total matching. Evaluation of this 
information is helpful for fine-tuning break key and match criteria. 


e Match Criteria—Summary Data Services generates one Match Criteria Summary report 
per match set to provide a consolidated view of all key settings and the criteria settings. 
You can evaluate this information to determine whether adjustment of field comparison 
lengths or criteria settings would be helpful. 


e Match Source—Statistics Summary The Match Source Statistics report provides 
information about duplicates within and across sources. 


e Match Duplicate—Sample The Match Duplicate Sample report provides a sample of 
duplicates in the match results. One report is generated for each Match transform in the 
job. 


E LESSON SUMMARY 
You should now be able to: 


e Perform post-match processing 





© Copyright. All rights reserved. 121 SAPA 
® 


